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Message-passing iterative decoders for low-density parity-check (LDPC) block codes are 
known to be subject to decoding failures due to so-called pseudo-codewords. These failures can 
cause the large signal-to-noise ratio performance of message-passing iterative decoding to be 
worse than that predicted by the maximum-likelihood decoding union bound. 

In this paper we address the pseudo-codeword problem from the convolutional-code per- 
spective. In particular, we compare the performance of LDPC convolutional codes with that of 
their "wrapped" quasi-cyclic block versions and we show that the minimum pseudo-weight of 
an LDPC convolutional code is at least as large as the minimum pseudo- weight of an underlying 
quasi-cyclic code. This result, which parallels a well-known relationship between the minimum 
Hamming weight of convolutional codes and the minimum Hamming weight of their quasi-cyclic 
counterparts, is due to the fact that every pseudo-codeword in the convolutional code induces a 
pseudo-codeword in the block code with pseudo- weight no larger than that of the convolutional 
code's pseudo-codeword. This difference in the weight spectra leads to improved performance 
at low-to-moderate signal-to-noise ratios for the convolutional code, a conclusion supported by 
simulation results. 
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Figure 1: Left: the Tanner graph associated with the parity-check matrix H that was defined in 
(jl.lj) . Right: (1:1:0, 1:1:0, 1:1:0) is a valid codeword in a cubic cover. 



1 Introduction 

Although low-density parity-check block codes (LDPC-BCs) have very good performance under 
message-passing iterative (MPI) decoding, they are known to be subject to decoding failures due 
to so-called pseudo-codewords. These are real-valued vectors that can be loosely described as error 
patterns that cause non-convergence in iterative decoding due to the fact that the algorithm works 
locally and can give priority to a vector that fulfills the equations of a graph cover rather than the 
graph itself. 

Example 1.1. Consider the trivial length-3 and dimension-0 code C = {(0,0,0)} defined by the 
3x3 parity-check matrix 
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1 1 1 
1 1 



(1.1) 



and whose Tanner graph is shown in Fig. ^ (left). A possible cubic cover of this Tanner graph is 
also depicted in Fig. U (right). Because (1:1:0, 1:1:0, 1:1:0) is a valid configuration in this cubic 
cover, the vector u = (|, |, |) is a pseudo-codeword of the code C defined by H (see [2,3]). □ 

It has been shown [2, 4, 5] that the performance of MPI decoding schemes for LDPC-BCs 
is largely dominated not by minimum Hamming weight considerations but by minimum pseudo- 
weight considerations, where the minimum pseudo-weight in the case of an additive white Gaussian 

a IMI 2 

noise channel (AWGNC) is defined as u>™ m (H) = min^e-p f, — where II ■ |h and II ■ IU are, respec- 

tively, the 1-norm and 2-norm, and V is the set of all pseudo-codewords for the code C defined 
by H. The minimum pseudo- weight is a measure of the effect that decoding failures have on the 
performance of the code. 1 As a consequence, the large signal-to-noise ratio (SNR) performance of 
MPI decoding can be worse than that predicted by the maximum-likelihood decoding union bound, 
which constitutes a major problem when trying to determine performance guarantees. Addressing 
this problem from the convolutional-code perspective, i.e., studying the pseudo-codeword problem 
described above for LDPC convolutional codes (LDPC-CCs), constitutes the major topic of this 
paper. 

We investigate a class of time-invariant LDPC convolutional codes derived by "unwrapping" 
certain classes of quasi-cyclic (QC) LDPC block codes that are known to have good performance 

1 Since w™ m (H) < -^^'"((I), the minimum Hamming weight may still be a key factor in performance analysis; 
however, the larger the gap between w™ m (H) and tUH ln (Q, the greater role the minimum pseudo- weight plays. In 
any case, the minimum Hamming weight is important for quantifying the impact of undetectable errors. 
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Figure 2: The MPI decoding performance of a rate R = 2/5 (3, 5)-regular LDPC convolutional 
code and three associated (3, 5)-regular QC-LDPC block codes. (Maximum of 50 iterations.) 

[6-8]. Unwrapping a QC block code to obtain a time-invariant convolutional code represents a 
major link between QC block codes and convolutional codes. This link was first introduced in a 
paper by Tanner [9], where it was shown that the free distance of the unwrapped convolutional 
code, if non-trivial, cannot be smaller than the minimum distance of the underlying QC code. This 
idea was later extended in [10,11]. More recently, a construction for LDPC convolutional codes 
based on QC-LDPC block codes was introduced by Tanner et al. [6,7], and a sliding- window MPI 
decoder was described. In that paper it was noted that the (non-trivial) convolutional versions 
of these codes significantly outperformed their block code counterparts in the waterfall region of 
the bit error rate (BER) curve, even though the graphical representations of MPI decoders were 
essentially equivalent. 

Throughout this paper we mainly take the approach of [2,3, 14], which connects the presence 
of pseudo-codewords in MPI decoding and linear programming (LP) decoding. LP decoding was 
introduced by Feldman, Wainwright, and Karger [15, 16] and can be seen as a relaxation of the 
maximum-likelihood decoding problem. More precisely, maximum-likelihood decoding can be for- 
mulated as the solution to an optimization problem, where a linear cost function is minimized 
over a certain polytope, namely the polytope that is spanned by the set of all codewords. For 
general codes, there is no efficient description of this polytope and so Feldman et al. suggested 
replacing it with an (efficiently describable) relaxed polytope, which in the following will be called 
the "fundamental polytope". In other words, the decoding result of the LP decoder is the point 
in the fundamental polytope that minimizes the above-mentioned linear cost function. 

In order to analyze the behavior of unwrapped LDPC convolutional codes under LP decoding, 
we need to examine the fundamental polytope [2, 3] of the underlying QC-LDPC block codes. 
(Because of symmetries, it is actually sufficient to study the structure of the fundamental polytope 
around the zero codeword, i.e., it is sufficient to study the so-called fundamental cone.) Our goal 
is to formulate analytical results (or at least efficient procedures) that will allow us to bound the 
minimum pseudo-weight of the pseudo-codewords of the block and convolutional codes. 

The paper aims at addressing this question and related issues. In the following sections, we will 
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Figure 3: Left: Number of iterations needed for a sum-product-algorithm-type MPI decoder to 
decide for the all-zero codeword. (1000 corresponds to no convergence.) Here the j3 values 1.00, 
2.00, 3.00, 4.00, 5.00 correspond to signal-to-noise ratios E^/Nq of, respectively, —2.04 dB, 0.96 dB, 
2.73 dB, 3.97 dB, 4.94 dB. Right: Number of iterations needed for a min-sum-algorithm-type MPI 
decoder to decide for the all-zero codeword. (2000 corresponds to no convergence.) We did not 
observe convergence to the all-zero codeword for a > 0.50 either for the sum-product-algorithm- 
type MPI decoder or for the min-sum-algorithm-type MPI decoder. 

study the connections that exist between pseudo-codewords in QC codes and pseudo-codewords in 
the associated convolutional codes and show that this connection mimics the connection between 
the codewords in QC codes and the associated convolutional codes. 

1.1 Motivational Example 

As a motivational example we simulated a rate R = 2/5 (3, 5)-regular LDPC-CC with syndrome 
former memory m s = 21, together with three wrapped block code versions: a [155,64] (3,5)- 
regular QC-LDPC block code, a [200, 82] code, and a [240, 98] code, with parity-check matrices of 
increasing circulant sizes r = 31, r = 40, and r = 48, respectively, while keeping the same structure 
within each r x r circulant [1] (see Section |2.1|) . (Note that increasing the circulant size of the 
QC code increases its complexity, i.e., its block length. Also note that each of the three block 
codes has rate slightly greater than 2/5.) A sliding- window MPI decoder as in [6] was used to 
decode the convolutional code. Conventional LDPC-BC MPI decoders were employed to decode 
the QC-LDPC block codes. All decoders were allowed a maximum of 50 iterations. The resulting 
BER performance of these codes on a binary-input AWGN channel is shown in Fig. |2j 

We note that, particularly in the low-to-moderate SNR region, where the complete pseudo- 
weight spectrum plays an important role, the unwrapped LDPC-CC performs between 0.5 dB and 
1.0 dB better than the associated QC-LDPC block codes. Also, as the circulant size increases, 
the performance of the block codes approaches that of the convolutional code. These performance 
curves suggest that the pseudo-codewords in the block code that result in decoding failures may 
not cause such failures in the convolutional code, which suggests that LDPC-CCs may have better 
iterative decoding thresholds than comparable LDPC-BCs (see also [12] and [13]). 

In order to underline the influence of pseudo-codewords under MPI decoding, we consider the 
following experiment. Let u be a minimal pseudo-codeword [3] for the above-mentioned R = 2/5 
(3, 5)-regular LDPC-CC, i.e., a pseudo-codeword that corresponds to an edge of the fundamental 
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cone of that LDPC-CC. Moreover, we define the log-likelihood ratio vector A(a,/3) to be 2 

\(a,0) = (3-{l- 2a|^jj|U>J . 

We then run the MPI decoder that is initialized with A = A (a, (3) and count how many iterations 
it takes until the decoder decides for the all-zero codeword as a function of a and (3. The results 
are shown in Fig. |31 

The meaning of A(a, j3) is the following. If a = 0, then A corresponds to the log-likelihood 
ratio vector that the receiver sees when the communication system operates at a signal-to-noise 
ratio E^/Nq = (3/{AR) and when the noise vector that is added by the binary-input AWGN 
channel happens to be the all-zero vector (see, e.g., the discussion in [3, Sec. 3]). For non-zero 
a the expression for A(a, f3) has been set up such that the LP decoder has a decision boundary 
at a = 0.5: for a < 0.5 the all-zero codeword wins against the pseudo-codeword u whereas for 
a > 0.5 the all-zero codeword loses against the pseudo-codeword u under LP decoding. 

The simulations in Fig. |3] were obtained using a search algorithm that looked for a low-pseudo- 
weight minimal pseudo-codeword u in the fundamental cone of the above-mentioned LDPC-CC. 
The u that we found has AWGNC pseudo- weight 18.1297, which happens to be smaller than the 
free distance. Secondly, we ran the MPI decoder for various choices of a and (3: Fig.|3]Left shows the 
number of iterations needed using a sum-product-algorithm- type MPI decoder whereas Fig.|HlRight 
shows the number of iterations needed using a min-sum-algorithm-type MPI decoder. (Note that 
the decisions reached by the latter are independent of the choice of (3, (3 > 0.) Because of the more 
oscillatory behavior of the min-sum-algorithm-type MPI decoder close to decision boundaries of 
the LP decoder, observed empirically, it is advantageous to run that decoder for many iterations in 
our scenario, whereas in the case of the sum-product-algorithm-type MPI decoder it hardly pays 
to go beyond 150 iterations. 

1.2 Paper Goals and Structure 

In this paper, we provide a possible explanation for the performance difference observed in the 
motivational example above. Based on the results of [2, 3] that relate code performance to the 
existence of pseudo-codewords, we examine the pseudo-codeword weight spectra of QC-LDPC 
block codes and their associated convolutional codes. We will show that for a non-trivial LDPC- 
CC derived by unwrapping a non- trivial QC-LDPC block code, 3 the minimum pseudo-weight of 
the convolutional code is at least as large as the minimum pseudo-weight of the underlying QC 
code [1], i.e., 

< n (H«) < < n (H conv ) . 

This result, which parallels the well-known relationship between the free Hamming distance 
of non-trivial convolutional codes and the minimum Hamming distance of their non-trivial quasi- 
cyclic counterparts [9], is based on the fact that every pseudo-codeword in the convolutional 
code induces a pseudo-codeword in the block code with pseudo-weight no larger than that of the 
convolutional code's pseudo-codeword. This difference in the weight spectra leads to improved 
BER performance at low-to-moderate SNRs for the convolutional code, a conclusion supported by 
the simulation results presented in Fig. [2j 

2 Note that the numerator in the ratio |[w||i/||w||a is not squared, therefore this ratio does not correspond to the 
AWGNC pseudo- weight of uj (see Sec. [3J , although it is closely related to that value. 

3 Non-trivial means here that the set of pseudocodewords contain non-zero pseudo-codewords. 
4 Non-trivial means here that the set of codewords contain non-zero codewords. 
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The paper is structured as follows. In Sec. [21 we develop the background necessary to describe 
the connection between pseudo-codewords in unwrapped LDPC convolutional codes and those 
in the associated QC-LDPC codes. Thus, in Sec. 12.11 we briefly discuss the connection between 
convolutional codes and their associated QC codes, especially how codewords in the former can be 
used to construct codewords in the latter, and in Sec. 12.21 we define the fundamental polytope/cone 
of a matrix and show how we can describe the fundamental cone of a polynomial parity-check 
matrix through polynomial inequalities. We end the section by showing how pseudo-codewords in 
unwrapped LDPC convolutional codes yield pseudo-codewords in the associated QC-LDPC codes. 
In Sec. [3 we compare the pseudo-weights of unwrapped convolutional and their associated QC 
block codes. Sec. 13.11 introduces various channel pseudo-weights and Sec. 13.21 presents the main 
result, namely that the minimum AWGN pseudo- weight, the minimum binary erasure channel 
(BEC) pseudo- weight, the minimum binary symmetric channel (BSC) pseudo- weight, and the 
minimum max-fractional weight of a convolutional code are at least as large as the corresponding 
minimum pseudo-weights of a wrapped QC block code. Sec. 0] discusses a method to analyze 
problematic pseudo-codewords, i.e., pseudo-codewords with small pseudo- weight. The method 
addresses the convolutional code case. It introduces two sequences of "truncated" pseudo-weights 
and, respectively, "bounded pseudo-codeword" pseudo-weights, that play an important role in 
identifying the minimum pseudo-weight of the convolutional code, similar to the role that column 
distances and row distances play in identifying the free distance. We end with some conclusions 
in Sec. [51 

1.3 Notation 

Throughout the paper we will use the standard way of associating a Tanner graph with a parity- 
check matrix and vice- versa. We will use the following notation. We let F2, K, R+, and M++ 
be the Galois field of size 2, the field of real numbers, the set of non-negative real numbers, and 
the set of positive real numbers, respectively. If a(X) is a polynomial over some field and r is 
some positive integer then (a(X) mod (X r — 1)) denotes the polynomial b(X) of degree smaller 
than r such that b(X) = a(X) (mod (X r — 1)). We say that a polynomial u>i(D) = Yli^t,^ 1 
with real coefficients is non-negative, and we write toi(D) > 0, if all its coefficients uj^ satisfy 
> 0. Similarly, a polynomial vector u){D) = (ui\(D), u)2(D), ■ ■ ■ ,u>l(D)) is non-negative, and 
we write w(D) > 0, if all its polynomial components toi(D) satisfy coi(D) > for all £ € {1, . . . , L}. 
Moreover, a polynomial matrix A(D) is non-negative, and we write A(-D) > 0, if all its entries are 
non-negative polynomials. Finally, for any positive integer r and for any r' £ {0, 1, . . . , r — 1}, I r / 
will represent the r'-times cyclically left-shifted identity matrix of size r x r. 

2 Pseudo-Codewords for QC and Convolutional Codes 

We begin this section by presenting an important link between QC block codes and convolutional 
codes that was first introduced in a paper by Tanner [9] and later extended in [10, 11]. Similar 
to the connection between codewords in an unwrapped convolutional code and codewords in the 
underlying QC code, we will show that pseudo-codewords in the convolutional code give pseudo- 
codewords when projected onto an underlying QC code. 

2.1 A Link Between QC Block Codes and Convolutional Codes 

In this section we introduce the background needed for the later development of the paper. Note 
that all codes will be binary linear codes. 
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Any length n = r ■ L quasi-cyclic (QC) code Cqc — Cqq with period L can be represented 

(r) 

by an rJ x rL scalar block parity-check matrix Hq^ that consists of J x L circulant matrices of 
size r x r. Using the isomorphism between the ring of binary circulant matrices of size r x r and 
the ring of polynomials ¥2[X]/ (X r — 1} of degree less than r, we can associate with the scalar 
parity-check matrix Hqq the polynomial parity-check matrix Hqq(X) € (¥2[X]/ (X r — l) s j JxL , 
with polynomial operations performed modulo X r — 1. Due to the existence of this isomorphism, 
we can identify two descriptions, scalar and polynomial, and use either of the two depending on 
their usefulness. 

Example 2.1. The cubic cover C^ 3 ^ of the trivial code in depicted in Fig. ^ is a quasi-cyclic 
code 5 with scalar parity-check matrix Hqq and corresponding polynomial parity-check matrix 

Hgl(X) G (F 2 [X}/(X r - l)) 3x3 (r = 3) given by 



H 



(3) 
QC 



1 1 

1 1 

1 1 

1 1 1 

10 10 10 

1 1 1 
1 1 
1 1 
1 1 



Io Io 
I 2 Io Ii 
I Io 



1 1 
X 2 1 X 
1 1 



(2.1) 



□ 



(r) 

By permuting the rows and columns of the scalar parity-check matrix Hqq (i- e -; by taking 
the first row in the first block of r rows, the first row of the second block of r rows, etc., then 
the second row in the first block, the second row in the second block, etc., and similarly for the 

— (r) (r) 

columns), we obtain the parity-check matrix Hq C of a code that is equivalent to Cqq. The scalar 

— (r) 

parity-check matrix Hq C has the form 



H 



(r) 
QC 



H 
Hi 



H r _2 

H r _i 



H r _i 
H 



H r _3 
H r _2 



H 2 
H 3 



H 
Hi 



Hi 
H 2 



H r _i 
H 



where the scalar J x L-matrices Ho, Hi, . . . , H r _i satisfy Hqq(X) = H0 + H1X + • • • + H r _iX r 1 . 
Example 2.2. The code in Example has H^(X) = H + HiX + tt 2 X 2 , where 

(2.2) 
□ 





"1 


1 


0" 




"0 





0" 




"0 





0" 







1 





,Hi^ 








1 




1 













1 


1 

























3 Not all covers are QC codes. 
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Given the polynomial parity-check matrix of a QC-code, it is easy to see the natural connection 
that exists between quasi-cyclic codes and convolutional codes (see, e.g., [7,9-11,17]). Briefly, with 

( r ) 

any QC block code Cqq of length r ■ L, given by a J x L polynomial matrix parity-check matrix 



Hg^(X) = H + HiX + • • • + Hr-iJf- 1 , with polynomial operations performed modulo X r — 1, 
we can associate a rate (L — J) /L convolutional code C con v given by the same J x L polynomial 

parity-check matrix H conv (L») = (hji(D)) JxL = H^(D) = H + Hi£> H h H^iD^ 1 , where 

the change of variables indicates the lack of modulo D r — 1 operations [6]. Let the syndrome former 
memory m s < r — 1 of C conv be the largest integer in {0, 1, . . . , r — 1} such that H ms ^ 0. Then 
the polynomial parity-check matrix H conv (D) has the following scalar description (see, e.g., [18]) 



H r 



H 
Hi 
H 2 



H 
Hi 



Hr 



H r _i H r _2 H r _3 
H r _i H r _2 
H r _i 



H 
Hi 
H 2 



H 
Hi 



H r 



H n 



H ms H ms _i H ms _2 



H ms — l 
H r „. 



(Note that H 

conv is a semi-infinite parity-check matrix.) 

Example 2.3. Unwrapping the code in Example 12.11 we obtain a convolutional code with poly- 
nomial parity-check matrix H conv given by 



H r 



,{D) 



1 

D 2 




1 
1 D 
1 1 



(2.3) 



The syndrome former memory is equal to 2 and the decomposition leading to the scalar matrix 
H C onv is H conv = Ho + HiD + H2-D 2 , where Ho, Hi, H2 are the same matrices as in (j2.2p , 6 □ 

(r) 

In the same way as we obtained C con v from Cq^, we can, upon choosing a positive integer 
r, obtain a code Cq^(X) from a given code C CO nv(-D)- 7 It easily follows that, similar to obtain- 
ing H conv by unwrapping Hqq, Hq^ is obtained from H conv by suitably wrapping H conv . This 
unwrapping/ wrapping can also be observed in the Tanner graph representations of Hq^,(X) and 
H C onv (D), as illustrated in Fig. EJ 

(r) 

We turn now our attention to codewords in these codes. Codewords in Cq^ will be denoted by 
a polynomial vector like c(X) and codewords in Cconv will be denoted by a polynomial vector like 
c(D) (when the code is represented by a scalar parity-check matrix H con v; then it will be denoted 
by a vector like c). (We will later use a similar notation for pseudo-codewords.) 

6 Note that this code is trivially zero. 

7 In this paper, unless specifically stated otherwise, we will consider the case r — 1 > m s , i.e., r > m s + 1. Some 
of the results can be stated for an arbitrary r. 



8 



Figure 4: Left: The finite Tanner graph of a QC code. Right: The infinite Tanner graph of an 
unwrapped convolutional code. 



For any non-zero codeword c(D) with finite support in the convolutional code, its r wrap- 
around, defined by the vector c(X) mod (X r — 1) <G (F2IX}/ (X r — 1)) L , is a codeword in the 

associated QC-code, since H^(X) • c(X) = in (W 2 [X]/(X r - l)) L . 

In addition, the Hamming weight of the two codewords is linked by the following inequality 
wn(c(X) mod (X r — 1)) < wu(c(D)), which gives the inequality [9,10] (we assume all codes are 
non-trivial) 

drain ( C Qc) < ^frcc (Qonv) for all T > m s + 1. 

Moreover, we have the following inequalities. 
Lemma 2.4. 

drain ( C Qc) < ^min ( C QC ) < ^min (CqC ) < ' ' ' for all r > m s + 1, 

and 

r-+oo < ^ m ' n (^"Qc) = ^frcc (Cconv) • 

Proof: Let H { ^}(X) be a parity-check matrix of C§§ and HqJ(X) = H%$(X) mod (X r - 1) 

be the corresponding parity-check matrix of Cqq. Let c(X) be a nonzero codeword in Cqq of 

weight equal to the minimum distance d m ia (Cqc ) ■ ^ e ^ ave ^QC ' c (^) = m (^2[X]/ (X 2r — 

1)) L and since X r - l\X 2r - 1, it follows that hJJ(X) • c(X) = in (F 2 [X]/(X r - 1}) L . 

If c(X) mod (X r — 1) ^ then we obtain the inequality drain (Cqc) — ^min (Cqc ) ■ ^ c (^0 = 
OmodpT - 1) we can write c(X) = (X r - l)ci(X) with ci(X) ^ Omod(X r - 1) (otherwise 
c(X) = Omod(X 2r - 1).) Hence H^(X) • (X r - 1) • Cl (X) = Omod(X 2r - 1) or equivalently, 

H^ C (X) ■ Cl (X) = Omod(X r - 1), which implies that a(X) is a nonzero codeword in Cqq- The 
Hamming weight d min (cgj) = w K (c{X)) = w n ({X r - l)ci(X)) > 2 • w K (a{X) mod (X r - 
1)) > wh_(ci(X) mod (X r — 1)) > d m ia (Cqc) • Mimicking this proof we also obtain df rcc (C con v) > 
rfmin (Cqc) ' an d hence the desired inequality 

dmia ( C Qc) ^ d min ( C Qc) ^ d min ( C Qc) ^ ' " " ^ d &cc (Cconv) for all r > m s + 1. 

From the way we construct the semi-infinite sliding matrix of the convolutional by unwrapping 
the scalar parity-check matrix of the QC block code versions, we can see that there exists a QC 
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code of circulant size r large enough, so that its minimum distance is equal to the free distance of 
the convolutional code. This assures the limit equality above. □ 
If we denote by Q = Q^ r \ Q^ 2r \ Q^ r \ ■ ■ ■ the Tanner graphs associated with the parity-check 
matrices of Cqq, Cq^ , Cq^ , ... of increasing size (which are associated with the same polynomial 
matrix H(D)), it is easy to see that we obtain a tower of covers of the graph Q. The above 
relationship between the minimum distances of the codes in the tower is then easily verified in 
graph language, since a codeword c(D) in a larger graph, say Cq^ , when projected onto the 

graphs of Cq^ and Cq^ using the formula c(X) mod (X r — 1), respectively, c(X) mod (X 2r — 1), 
gives another codeword. Finally, the graph of the associated convolutional code is an infinite cover 
of each of the graphs in the tower. 

Example 2.5. Using the graphs associated with the trivial code C = {(0, 0, 0)} and its cubic cover 
C*- 3 ** having parity-check matrix Hq^, (see Example 12, ljl corresponding to the codeword (0,0,0) in 
C, we can identify the codeword c = (1:1:0, 1:1:0, 1:1:0) G Nullsp(H^) in C( 3 ). The polynomial 
description of c is c(X) = (1 + X, 1 + X, 1 + X ) E Nullsp(H^(X)). The wrap-around of c(X) 

mod(X-l), c(X) mod (X - 1) e (¥ 2 [X]/(X r - l)) 3 ,r = 1, gives the codeword (0,0,0) in C. The 
associated convolutional code is trivially zero. □ 



2.2 The Fundamental Cone of the Parity-Check Matrices of QC and Convolu- 
tional Codes 

In this section we introduce our main object of study, the fundamental cone of a matrix, a set that 
contains all relevant pseudo-codewords [2-4,14-16]. 

Definition 2.6 (see [2, 3, 15, 16]). Let H be a binary matrix of size m x n, let X = 1(H) = 
{0, . . . , n — 1} be the set of column indices of H, and let J = ^(H) = {0, . . . , m — 1} be the set 
of row indices of H, respectively. For each j € J ', we let Xj = Xj(H) = {i € X \ hji = 1} be the 
support of the j-th row of H. The fundamental polytope V = V(H) of H is then defined as [2,3] 

m 

V=f) ConvexHull(Cj) with Q = {c£ {0, l} n r iC T = mod 2 } , 

where Tj is the j-th row of H, and ConvexHull(C/) is the convex hull of C,-, defined as the set of 
convex combinations of points in Cj when seen as points in M n . The fundamental cone /C(H) of H 
is defined as the conic hull of the fundamental polytope V, which includes the vertex 0, stretched 
to infinity. Note that if a; € /C(H), then also a ■ u € /C(H) for any real a > 0. Moreover, for any 
u € /C(H), there exists an a > (in fact, a whole interval of a's) such that a ■ u £ "P(H). □ 

Vectors in "P(H) are called pseudo-codewords of H, and we will call any vector in /C(H) a 
pseudo-codeword and two pseudo-codewords that are equal up to a positive scaling constant will 
be considered to be equivalent. Clearly, not all pseudo-codewords are codewords, but all codewords 
are pseudo-codewords. 

The fundamental cone and polytope can both be described by certain sets of inequalities that 
are computationally useful [3,16]. Since our study of pseudo-codewords will rely heavily on the 
fundamental cone, we now describe it in more detail. 
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Figure 5: The fundamental polytope for H = [1 1 1]. 



A vector cj = (cji, . . . , u n ) G W 1 is in the fundamental cone K = /C(H) if and only if 

Ui > for all i G X(H), 
u>i > Wj/ for all j G i7(H) and for all i' G 2j(H). 

i6(3i\{*'}) 



(2.4) 
(2.5) 



Example 2.7. Let H = [1, 1, 1] be a parity-check matrix for the code C = {(0,0,0), (1,1,0), 
(1, 0, 1), (0, 1, 1)}. The fundamental polytope "P(H) associated with H is (see Fig. 0) 



< LOl < 1 
< W 2 < 1 

< w 3 < 1 



— Wl + UJ 2 + U>3 > 

+UJi - Ll>2 + U>3 > 

+ W2 - W 3 > 

+0>1 + Ll>2 + ^3 < 2 



> • 



from which we obtain the fundamental cone /C(H) associated with H: 



/C(H) 



(UJI,U)2,U> 3 ) G 



< Wl — + W2 + <-^3 > 

< U>2 +0J\ — U>2 + <^3 > 
< LU3 +UJ\ + U>2 — 0^3 > 



We see that the pseudo-codeword u) = (|, |, |) satisfies the inequalities of both the fundamental 
polytope and the fundamental cone. Also, the pseudo-codeword u = (1, 1, 1) and its positive 
multiples are in the fundamental cone but not necessarily in the fundamental polytope. □ 

Remark 2.8. As explained briefly in Sec.Q pseudo-codewords can also be described using graph 
theory language as vectors corresponding to codewords in a cover graph. Looking again at the 
cover tower Q = Q^ r \ Q^ 2r \ Q^ r \ a codeword c(D) of a larger graph, say Cqc, seen as a 
vector with real components, when projected over R onto the graphs of Cq^ and Cqq , using the 
formula c(X) mod (X' r — 1), respectively, c(X) mod (X 2r — 1), gives a pseudo-codeword. Similarly, a 
codeword c(D) of the unwrapped convolutional code C con v) when projected onto the graphs of Cqq 

r (2r) r (4r) 
-QC ' ^QC 



, using the formulas c(X) mod (X r - 1) , c(X) mod (X 2r - 1), c(X) mod {X Ar - 1 



gives pseudo-codewords in the QC codes. 

Example 2.9. Let c = (1:1:0, 1:1:0, 1:1:0) G Nullsp(H^) be the codeword identified in Exam- 
ple l2.5l for the code with corresponding polynomial description c(X) = (1+X, 1 + X, 1 + X) G 
Nullsp(H^(X)). When projecting c(X) over R onto the graph of C = {(0,0,0)} we obtain 
c(X) mod (X — 1) = (2, 2, 2), which is a pseudo-codeword in C, equivalent to the pseudo-codeword 
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u: = (|, |, I) presented in Example ll.il The fundamental cone of the convolutional code given by 
H CO nv(-t>) in Example 12,31 is trivially zero, i.e., it does not contain any non-zero pseudo-codewords. 

□ 

Remark 2.10. In the above example we made the following important connection between two 
graphical ways of representing pseudo-codewords ld of a QC code C of length rL as vectors corre- 
sponding to codewords c(X) in some QC code (or convolutional code) associated with an m-cover 
(or an infinite cover) of the graph of C. We can see them as vectors with each entry represent- 
ing an average of the ones attributed to the variable nodes of the cover graph that are in the 
"cloud" of that entry, e. g., u = (|, |, |), corresponding to pseudo-codewords in the polytope of 
C, or we can see them as vectors that are the projections of c(X) onto the code C by the formula 
c(X) mod {X r — 1), e.g., u = (2, 2, 2), corresponding to pseudo-codewords in the fundamental cone 
of C that are not necessarily in the polytope. 

In fact a similar statement can be made about pseudo-codewords in any block code C of length 
rL, not having necessarily a QC structure. A pseudo-codeword u: in C is a vector corresponding 
to some codeword c in some block code (or convolutional code) associated with an m-cover (or 
an infinite cover) of the graph of C with each entry representing an average of the ones attributed 
to the variable nodes of the cover graph that are in the "cloud" of that entry. However, this is 
the same, under the equivalence between different representations of a pseudo-codeword in the 
fundamental cone, as defining a pseudo-codeword w as a vector for which there exists a cover 
corresponding to a block code or to a convolutional code, C, of the code C, and there exists some 
codeword, c in C, such that u> = cmod (rL) where the vector c is equal to a permutation of the 
vector c (where the components of c are taken in the following order 1, m + 1, . . . , (L — l)m + 
1, 2, m + 2, . . . , (L — l)m + 2, . . . , m — 1, 2m — 1, ... , Lm — 1). The vector c it is seen as a vector 
with real entries and the modular operation mod (rL) is performed over the real numbers. 

The above description of the fundamental polytope and cone provides computationally easy 
descriptions of the pseudo-codewords, especially in the case of QC and convolutional codes. 
From (J2.4j) and (|2.5f) it follows that for any parity-check matrix H there is a matrix K such 
that a vector u is a pseudo-codeword in the fundamental code /C(H) if and only if K • uj t > T . 
In the case of QC and convolutional codes, the fundamental cone can be described with the help 
of polynomial matrices. Namely, for any Hqq(X) there is a polynomial matrix Kq^,(X) such that 

u{X) G fc(H^Lpf)) if and only if K^,(X) • u(X) T > T . Similarly, for any H conv (D) there is a 
polynomial matrix K conv (D) such that u(D) G /C(H conv (D)) if and only if K conv (D) ■u(D) T > T . 

This fundamental-cone description is particularly simple and hence useful in the case of mono- 
mial parity-check matrices H conv (see [6]), since the fundamental-cone inequalities can then be 
stated very simply as follows. A vector a; is a pseudo-codeword in the fundamental cone of a mono- 
mial matrix H conv if and only if the associated polynomial vector us(D) = (o;i(D), L02{D), . . . , u>l(D)) 
satisfies: 

Ui(D)>0, for all t G {1,... ,L}, 
hje(D)u e (D) > h je >(D)uj e/ (D), for all j G {1, . . . , J}, for all £' G {1, . . . ,L}. 

ee{i,...,L}\m 

Equivalently, if f2(L>) is an L x L matrix with ^-th row vector 

••• +ut-i(D) -u e (D) +cj i+1 (D) ■■■ co L (D)) 
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for j € {1, ... , L}, then uj(D) is a pseudo-codeword if and only if uji(D) > for all £ £ {1, . . . ,L} 
and 



,{D) ■ n(D) T > 0. 



Example 2.11. Let 



r(D) 



11 1 1 ' 

\ D D 2 D 3 
1 L> 4 L> 3 L> 2 



be a polynomial parity-check matrix of a rate-1/4 convolutional code. The following vector 

w(D) = (3D 2 + D 3 AD + D 2 3 + D + AD 2 + D 3 3 + AD + L> 2 ) 

= (0 3 3) + (0 4 1 4) D + (3 1 4 l) D 2 + (l 1 0) D 3 , 

corresponding to the scalar vector 

57 =(0,0, 3, 3, 0,4,1,4, 3,1,4,1, 1,0,1,0, •••), 

is a pseudo-codeword for the convolutional code since u>(D) > and the matrix 



n(D) 



-3D 2 -D 3 +AD + D 2 +3 + D + AD 2 + D 3 +3 + AD + D 2 ' 

+3D 2 + D 3 -AD-D 2 +3 + D + AD 2 + D 3 +3 + AD + D 2 

+3D 2 + L> 3 +4D + D 2 -3-D -AD 2 - D 3 +3 + AD + D 2 

+3L> 2 + L> 3 +4D + D 2 +3 + £> + 4L> 2 + D 3 -3 - AD - D 2 



gives 



H conv (£>) ■ n(D) T 



6 + 9D + 3D 2 AD 2 
6 + D + D 2 + 2D 3 2D 2 
7D + D 2 
D + 7D 2 + 2D 3 



AD 3 + 8D 4 + 2D 5 6D 3 + 2D 4 + 8D 5 + 2D 6 



AD 3 + 8D 4 + 2D 5 6D 2 + 8D 3 
AD 2 + AD 3 6D 2 
10D 2 8D 5 



2D 4 
2D 3 
2D 6 



> 0, 



i.e., H conv (D) • Q(D) T has all polynomial entries with non-negative coefficients. 



2D b 



□ 



Similarly, for a QC code that is described by a monomial J x L parity-check matrix Hq^(X), 
a vector u>(X) is a pseudo-codeword in the corresponding fundamental cone if and only if the 
associated polynomial vector u>(X) = (oji(X),U2(X), . . . ,u>l{X)^ satisfies 



(uj e (X) mod (X r - 1)) > for all £ € {1, . . . , L}, 



and 



U^ ] c (X)n(X) T mod (X r - 1)) > 0. 

Example 2.12. Let Hq|,(X) be the r = 5 parity-check matrix obtained from H conv (D) in Ex- 
ample for a QC block code of length n = 20. Then, for the polynomial vector 

u {5 \X) = (3X 2 + X 3 AX + X 2 3 + X + AX 2 + X 3 3 + AX + X 2 ), 
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we obtain 



H^(X)fi(X) T mod(X 5 -l) 

6 + 9X + 3X 2 2 + 4X 2 + 4X 3 + 8X 4 8 + 2X + 6X 3 + 2X 4 n 1 

6 + X + X 2 + 2X 3 2 + 2X 2 + AX 3 + 8X 4 2X + 6X 2 + 8X 3 + 2X 4 
7X + X 2 AX 2 + AX 3 6X 2 + 2X 3 

X + 7X 2 + 2X 3 WX 2 8 + 2X 

Since u^ 5 \X) > and (H§l(X)fi(X) T mod (X 5 - 1)) > 0, we know that u^(X) is a pseudo- 
codeword. □ 

The pseudo-codeword u(D) = (3D 2 + D 3 , AD + D 2 , 3 + D + AD 2 + D 3 , 3 + AD + L> 2 ) in 
Example 12.111 was obtained by using a method to be described in Sec.@J Projecting this pseudo- 
codeword in the convolutional code onto the r-wrapped QC code, for r = 5, we obtained the 
pseudo-codeword u(X) = (3X 2 + X 3 , AX + X 2 , 3 + X + AX 2 + X 3 , 3 + AX + X 2 ) in the QC 
code. This is not a mere coincidence as the following lemma shows. 

Lemma 2.13. Let uj(D) be a pseudo-codeword in the convolutional code defined by H conv (L>), 
i.e., u(D) £ /C(H conv (Z))). Then its r wrap-around polynomial vector is a pseudo- codeword in the 
associated QC-code defined by H^(X), i.e., cj(X)mod(X r — 1) £ £(H^(X)). 

Proof: We have seen that for any H conv (L>) describing a convolutional code there is a matrix 
K CO nv(-t>) such that a polynomial vector u(D) is a pseudo-codeword in the fundamental cone 
/C(H conv (D)) if and only if K conv (D)u(D) T > T . By reducing K conv(-C) modulo D r — 1, we obtain 
a matrix K^ C (X) = (K 

conv (X) mod (X r — 1)) with the property that a polynomial vector u>(X) 

is a pseudo-codeword in the fundamental cone /C(H^(X)) if and only if K { ^1(X)lj(X) t > T . 

Reducing K conv (D)u(D) T > T modulo D r - 1, we obtain K^ ] c (X)(u(X) mod (X r - 1)) T > T , 
which proves the claim. □ 

Remark 2.14. This result can be easily deduced using the graph-theory language at the end of 
Sec. 12.11 Indeed, looking again at the cover tower Q = ^ r ',5' 2f ',^ 4r v--i a pseudo-codeword 
u)(D) of a larger graph, say Cqq , when projected onto the graphs of Cq£ and Cq£? , using the 
formula u>(X)mod(X r — 1), respectively, u(X)mod(X 2r — 1), gives another pseudo-codeword. 
Similarly, a pseudo-codeword uj(D) of the unwrapped convolutional code C con v> when projected 
onto the graphs of Cqq Cq£ f , Cqq , . . . , using the formulas mod (X r — 1), u;(X) mod (X 2r — 1), 
u>pf) mod (X 4r — 1), ... , gives pseudo-codewords in the QC codes. 

Example 2.15. The pseudo-codeword 

u(D) = (3D 2 + D 3 AD + D 2 3 + D + AD 2 + D 3 3 + AD + L> 2 ) 

of the convolutional code in Example 12.111 projects onto the following pseudo-codewords in the 
QC wrapped block code versions of parity-check matrix Hq^(^\T) — H conv (X)modpr - 1), for 



14 



all r > 0: 8 

u {r) (X) = u{D) mod (X r - 1) = (3X 2 + X 3 AX + X 2 3 + X + 4X 2 + X 3 3 + 4X + A 2 ) , 

for all r > 4, (r = 5 case above is included, ) 
cj (3) (X) = u(D) mod (X 3 - 1) = (l + 3A 2 4X + X 2 A + X + AX 2 3 + 4A + A 2 ) , 
uj {2 \X) =u(D)mod(X 2 -1) = (3 + X 1 + AX 7 + 2X A + AX), 
cj (1) (A) = u(D)mod(X - 1) =(4 5 9 8). 

□ 

3 Pseudo- Weight Comparison Between QC Codes and Convolu- 
tional Codes 

This section begins with an introductory subsection in which various channel pseudo-weights are 
defined and continues with the main result that the minimum AWGNC, BEC, BSC, and max- 
fractional pseudo- weights of a convolutional code are at least as large as the corresponding pseudo- 
weights of a wrapped QC block code. 

3.1 Definitions of Channel Pseudo- Weights 

Definition 3.1. [2-5,15,16] Let u> = (w , ...,u n -i) be a nonzero vector in R™. The AWGNC 
pseudo-weight and the BEC pseudo-weight of the vector oj are defined to be, respectively, 

AWGNC (w) A J« > BEC (uj) = | 

1Mb 

where ||w||i and ||u;||2 are the 1-norm, respectively 2-norm, of cj, and supp(o^) is the set of all 
indices i corresponding to nonzero components U{ of uj. In order to define the BSC pseudo- weight 
Wp SC (u), we let oj' be the vector of length n with the same components as u but in non-increasing 
order. Now let 

m)="'i («<£<* + l, 0<£<n), F(0=p(O^> and e 4 F -i ^*MY 

Then the binary symmetric channel (BSC)-pseudo- weight Wp SC (u) is Wp (oj) = 2e. Finally, the 
fractional weight of a vector u € [0, l] n and the max-fractional weights of a vector u; E R™ are 
defined to be, respectively, 

U>frac(<*>) = |M|l, 'Wmax-frac(^) A 



,M|oo 

where ||^||oo is the infinite or max norm. For u: = we define all of the above pseudo- weights, 
fractional weights, and max-fractional weights to be zero. □ 

A discussion of the motivation and significance of these definitions can be found in [3]. Note 
that whereas the fractional weight has an operational meaning only for vertices of the fundamental 
polytope, the other measures have an operational meaning for any vector in the fundamental 
polytope or cone. Note also that here Wf Tac and u> m ax-frac are defined for any vector in R", 
whereas ^f rac and i«max-frac in [15] are what we will call w™™ and w^x-frac- 

8 If the convolutional code has a monomial polynomial parity-check matrix then we do not need the earlier 
assumption r > m s + 1, as Lemma 12. 131 holds for each r > 1. 
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Example 3.2. Let 

u)(D) = (3D 2 + D 3 4D + D 2 3 + D + 4D 2 + D 3 3 + 4D + D 2 ) 

be the pseudo-codeword in Example 12.111 and let u; = (0,0,3,3, 0,4,1,4, 3,1,4,1, 1,0,1,0, 
0, 0, 0, 0, . . .) be its scalar vector description. Then 



., AWGNC /. ,\ _ ll^lll _ V° " ^"f ° " ° "T- J • _ o /ft; „..BEC 



In order to compute Wp SC (a;), we let a/ = (4, 4, 4, 3, 3, 3, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, . . .) be the vector 
that lists the components of u> in non-increasing order. We obtain Wp SC (u) = 2e = ^ = 6.67, 
since we need to add up e = j ordered components of u' to obtain J^Ui = 13. Finally ||w||oo = 
max^^i, from which it follows that ui max _f rac (a;) = jjil^jp" = T = ^ 

A measure of the effect that the pseudo-codewords have on the performance of a code is given 
by the minimum pseudo-weight [2,3, 15, 16] 

w min (H) A mm A,) 

p ^eV(P(H))\{o} 

where V(V(H)) \ {0} is the set of all non-zero vertices in the fundamental polytope V(H) and 
the pseudo-weights are the appropriate ones for each channel (AWGNC, BSC, and BEC pseudo- 
weights) and the minimum fractional and max-fractional weights. 

Computing these values can be quite challenging, since the task of finding the set of vertices 
of V(H) is in general very complex. However, in the case of four of the above pseudo-weights 
(the minimum AWGNC, BSC, and BEC pseudo-weights and the minimum max-fractional weight) 
there is a computationally simpler description given by 

wf n CH) = min wJw), 

for the appropriate pseudo- weight. (Note that there is no such statement for the minimum frac- 
tional weight; see, e.g., [3,16]). 



E? 



xample 3.3. Let Hqq(X) be the matrix of Example 12 . 1 21 and let Cqq be the QC block code of 

length n = 20, with r = 5. The minimum distance is 6 and it is equal to the u>p WGNC ' mm (HQQ). 
(This was obtained using a vertex enumeration program for polytopes that lists all the minimal 
pseudo-codewords of a code, see [21].) □ 

Remark 3.4. In [3] it was shown that for any code defined by a parity-check matrix H, the 
following inequalities hold: 



<£(H) < <i n x _ frac (H)< ^WGNC,min (H) < w BEC, m in (H); 

<£(H) < <i n x _ frac (H) < ^ sc < min (H) < u,f c < min (H). 



Therefore, w™™(H.) and 

^max— frac(^) can serve as lower bounds for uip ' (H), Wp ' (H), 

j BEC, min /ttx 

and w p (ti). 
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3.2 Minimum Pseudo- Weights 

In what follows, we compare the minimum pseudo-weights and minimum max-fractional weight 
of a QC block code to those of its corresponding convolutional code which we assume to have a 
fundamental cone containing non-zero vectors. In order to analyze the minimum pseudo-weight 
and minimum max-fractional weight, it is sufficient to analyze the weights of the non-zero vectors 
in the fundamental cone. Throughout this section, without loss of generality, all pseudo-codewords 
cj(D) are assumed to have finite support. 9 

Theorem 3.5. For the AWGNC, BEC, and BSC pseudo-weights, if u(D) G /C(H conv (£>)), then 

w p (u(X)mod{X r - 1)) < k; p (u>(.D)). 

Therefore, if the fundamental cone of the convolutional code is not trivial (i.e., it contains non-zero 
vectors) we obtain 

< n (HW(X))<<"(H conv (D)). 

Proof: In the following, we need to analyze separately the AWGNC, BEC, and BSC pseudo- 
weights of oj{D) and of its r wrap-around u)(X) mod (X r — 1). Let u}(D) = (u)i{D), . . . , ujl(D)) 
be a pseudo-codeword. By assumption u(D) has finite support, i.e., there exists an integer t such 
that the maximal degree of any u>£(D), I G {1, . . . , L}, is smaller than t. 

Case 1 (AWGNC): Since ||u>(D)||i = \\u(X) mod (X r - l)||i and 

L r-l /L(*-l)/rJ \ 2 L r—1 L(«-l)/rJ 

1=1 i=0 y i'=0 J £=1 i=0 i'=0 

we obtain u£ WGNC (w(Z>)) > u£ WGNC (u;(X) mod (A r - 1)). 

Case 2 (BEC): Since the components of the vector i*)(X) mod (X r — 1) are obtained by adding 
in M + certain non- negative components of w(-D), it follows that 

|suppcj(L>)| > |supp(cj(X)mod(A r - 1)) | 

and we obtain w® EC (u(D)) > w® EC (u(X) mod (X r - 1)). 

Case 3 (BSC): In order to compare the BSC-pseudo- weight of the two vectors, we first need to 
arrange the components in decreasing order. Let Mq > M\ > . . . > M t L-\ and iuq > mi > . . . > 
m r L-\ be listings of all the coefficients of all the components of u(D) and u(X)mod(X r — 1), 
respectively, in non-increasing order. Since ||oj(D)||i = ||cu(A) mod (X r — l)||i, we obtain that 

M^k = MX) mod (^-1)11! A M) wMch giveg Jj-1 M . = ^rL-l^ = 2M _ Hence ^ ^ 

sequences of non-negative integers form two partitions, A and fi, respectively, of 2M. We fill the 
shorter partition with zeros so that both partitions have the same length, say P. It is enough to 
show that ^21=q Mi < YTi=o m i ^ or an ^' = 1' ^, . . . , P, i.e., that /U majorizes A [19]. 

We show first that tuq > Mq. Suppose the contrary, i.e., tuq < Mo. Since mj < niQ for all 
i G {0, . . . , P - 1}, we obtain that m, < M for all i G {0, . . . , P - 1}. But m u i G {0, . . . , P - 1}, 
was obtained by adding over M.+ a certain subset of the set {M v \ i' G {0, . . . P - 1}}. So there 
should be at least one r?v that has Mq in its composition, and hence r?v > Mo. This is a 
contradiction, from which we obtain tjiq > Mq. 

9 With suitable modifications, this can easily be generalized to u(D ) with ||w(.D)||i < oo. Note that such 
polynomial vectors also fulfill ||o;(D)||2 < oo. 
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We finish the proof by induction. Namely, we want to show that from Yll=o Mi < Yll=o m i f° r 
some i' G {1, . . . P — 1}, it follows that Ya=o Mi < Yll=o m «- ^ — m «' then this induction step 
clearly holds. So, assume that My > my. Since mp_i < . . . < m,;/ < Mj/ < M i /_ 1 < . . . < Mq, we 
can deduce that rrij/ , and in fact all nii with i' < i < P — 1 , cannot contain any Mj with < i < i' 
in its composition. Hence all possible Mi, < i < i', have occurred in the composition of mi for 
< % < i' — 1, which gives X)i=o m * — S^o 1 m * — Si=o This proves that /x majorizes A and 
we obtain 



BSC ( u{D)) > w® sc (u{X) mod (X r - 1)) . 



□ 

Theorem l3.5l implies that low-pseudo-weight vectors in the block code may correspond to higher 
pseudo-weight vectors in the convolutional code, but the opposite is not possible. This suggests 
that the pseudo-codewords in the block code that result in decoding failures may not cause such 
failures in the convolutional code, thereby resulting in improved performance for the convolutional 
code at low-to-moderate signal-to-noise ratios (SNRs). 

A similar bound also holds for the max-fractional weight, as shown in the next theorem. 

Theorem 3.6. If u{D) G /C(H conv (L>)) ; then 

^max-fracM^) mod (X T - 1)) < 

'"max- frac 



^max-frac (HqcPO) — ^max-frac (H CO nv {D)) ■ 



Therefore, 

„„min ( t 

l QC 

Proof: We have ||w(D)||i = \\w(X) mod (X r - l)||i and 

L(*-l)/rJ 

I I v\ A ( VT -\\\\ L r-1 L r-1 [(t-l)/r\ ,. , 

cJlAlmodfA —1) = max max > w> i+;' r > max max max Ufi+i' r > \\u)(D)\ 

'Moo £=1 j = Q Z — ✓ >^ £ =1 j = Q j, =Q > T 

i'=0 



which leads to w max _f mc (a;(X) mod (X r — 1)) < ii; max _f rac (u;(Z))). It now follows that 

l QCi 



^max-fracl-n-OC/ — u7 max-fracV- tl conv / /- 



□ 

In the case of the fractional weight, it is easy to see that for any uj(D) G V(V(H conv (D))) \ {0}, 
we have ||a;(D)||i = ||^(AT) mod (X r — l)||i, and hence Wf rac (uj(X) mod (X r — 1)) = u>f rac (u;(.D)). 
When comparing the minimum fractional weight of the convolutional and QC codes, we encounter 
a computatinally harder these values must be computed over the set of nonzero pseudo- 

codewords that are vertices of the fundamental polytope. This is not an easy task, because a 
vertex pseudo-codeword in the convolutional code might not map into a vertex pseudo-codeword 
in the QC code. 

The theorem below, however, can be established. For its better understanding, we recall that 
%mc( H Qc) has the following meaning [15, 16]. Let £§l C T(H^) be the set of posit ions where 
bit flips occurred when using Cqq for data transmission over a BSC with crossover probability p, 

< p < 1/2. If \£qq\ < ^w™™(Hqq), then LP decoding is correct. Similarly, w™™(H conv ) implies 
the following. If £ conv C I(H conv ) is the set of positions where bit flips occurred when using C con v 
for data transmission over a BSC, then |£ C onv| < ^fr a c(H conv ) guarantees that LP decoding is 
correct. 
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Table 1: The pseudo- weights of the pseudo-codewords in Example 12.151 





...AWGNC 
p 






'"max- frac 


Wfrac 


u>(D) 


8.45 


11 


6.67 


6.5 


26 


u>(X)mod (X r - 1) 
for all r > 4 


8.45 


11 


6.67 


6.5 


26 


u(X) mod (X 3 - 1) 


7.86 


10 


6.5 


6.5 


26 


u(X) mod {X 2 - 1) 


6.09 


8 


5 


3.71 


26 


u(X)mod {X - 1) 


3.63 


4 


3 


2.89 


26 



Theorem 3.7. Assume that we are using C conv for data transmission over a BSC with cross- 
over probability p, where < p < 1/2, and that bit flips occur at positions £ CO nv Q ^(H cori v)- 
If l^convl < \ w {^(^qc)> then LP decoding is correct. (Note that on the right-hand side of the 
previous inequality we have wJ£^(Hqq) and not w^^(H conv ).) 

Proof: We know that 

W frac (^-Qc) — ^max-frac (^Qc) — ^max-frac (H C onv) , (3.1) 

where step (*) follows from [15,16] (see also [3]) and step (**) follows from Theorem 13.61 

Remember that fracG^Qc) ^ as ^ e following meaning [15,16]. Let £ C onv £ ^(H conv ) be 

the set of positions where the bit flips occured when using C conv for data transmission over a BSC. 

If |£conv| < ^mix-frac( H conv), then LP decoding is correct. 

Now, because the theorem statement assumes that |£ C onv| < l^fracO^Qc)' usm § (|3 . lf> we have 

|fconv| < ^mLx-frac( H conv) and so, according to the meaning of ^™x-frac( H conv), LP decoding is 

correct. □ 

Remark 3.8. As discussed at the end of Sec. 6 in [3], w™™(H) and ^max-frac(H) can gi ve > 
especially for long codes, quite conservative lower bounds on u>p SC ' mm (H). (E.g., the guarantees 
on the error correction capabilities of the LP decoder implied by w^°(H) and ^max-frac(H) are n °t 
good enough to prove the results in [20].) However, a positive fact about ^™™(H) and ^max-frac(H) 
is that there are polynomial-time algorithms that compute these two quantities [15,16]. 

Remark 3.9. It is not difficult to adapt Theorem I3.5b nd Theorem 13.61 such that conclusions 
similar to the ones in Theorems 13.51 and 13.61 can be drawn with respect to a QC block code with 
the same structure but a larger circulant size that is a multiple of r. In fact, most QC block codes 
with the same structure but a larger circulant size, even if not a multiple of r, behave according 
to Theorem 13.51 Using similar arguments to the ones in the proofs of these thorems we obtain 
the following more general inequalities that hold for the AWGNC, BEC, BSC max-fractional and 
fractional pseudo- weights. If u;(D) E /C(H conv (D)), then 

w p (u(X)mod(X r -l)) <w p (u(X)mod(X 2r -l)) < w p (u(X) mod (X 4r -1)) < . . . < w p (u(D)), 
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Figure 6: The performance of a rate R = 1/4 (3, 4)-regular LDPC convolutional code and three 
associated (3, 4)-regular QC-LDPC block codes. Note that the horizontal axis is E s /Nq and not 
the more common E^/Nq = (1/R) ■ E s /Nq. 

for all r > 1. In addition, for the AWGNC, BEC, BSC and max-fractional minimum pseudo- weights 
the following holds for any m > 1: 

w p (u(X) mod (X r - 1)) < w p (u(X) mod {X mr - 1)) . 

Example 3.10. To illustrate how the pseudo- weights of the pseudo-codeword in the convolutional 
code and its projections onto the QC versions satisfy the pseudo-weight inequalities in Remark l3.9l 
for all the defined pseudo- weights Wp WGNC , w? EC , Wp sc , u> m ax-frac and W{ rac , we computed the 
pseudo- weights of the pseudo-codewords in Example 12.151 Tabled contains these results. 

□ 

Next we exemplify some of the bounds on the minimum pseudo-weight of codes proved in this 
section. We take a tower of three QC codes together with their convolutional version and compute 
their minimum pseudo-weights which, according to the bounds derived above, form a sequence of 
increasing numbers, upper bounded by the minimum pseudo-weight of the convolutional version. 
However, due to the large code parameters, we were able to compute only the minimum pseudo- 
weight of the smaller code of length 20. For the other QC codes we used the methods of [2,3] to 
give lower and upper bounds. 

Example 3.11. Consider the (3, 4)-regular QC-LDPC code of length 4r given by the scalar parity- 
check matrix, or polynomial parity check matrix, respectively, 

11 1 1 " 

1 X X 2 X 3 . 
1 X 4 X 3 X 2 _ 

For r = 5 we obtain a [20, 7, 6] code with rate R = 0.35. By increasing r we obtain other QC 
codes. By taking powers of r we obtain a tower of QC codes whose graphs form a sequence of 



H 



(r) 
QC 



lo lo lo lo 
lo Ii I2 I3 
lo I4 I3 I2 
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Table 2: The minimum pseudo-weights of the codes Cqq, given by the parity-check matrix 
U ( qI(X), for r = 1,2,3,4. 



< n (c£i) 


AWGNC,min 


BEC,min 
Wp 


BSC,min 
Wp 


min 

max— trac 


r = 1 


2 


2 


2 


2 


r = 2 


2 


2 


2 


2 


r = 3 


4 


4 


4 


3 


r = 4 


4 


4 


4 


4 



covers over the Tanner graph of the [20, 7, 6] code. For r > 9, all the codes have minimum distance 
10, and hence the free distance of the associated rate R = 1/4 convolutional code is <if ree = 10, 
strictly larger than the minimum distance 6 of the [20, 7, 6] code. 

For comparison we simulated these codes together with the associated convolutional code. 
The results for an AWGN channel are given in Fig. HO For the [20, 7, 6] code we ran a vertex 
enumeration program [21] for polytopes that lists all the minimal pseudo-codewords and found that 
the minimum pseudo-weight of the [20, 7, 6] code is w AWGNC,mm _ 6.00. The larger parameters of 
the other three codes allowed us to only lower and upper bound the minimum pseudo-weights. 10 
For the [40, 12, 10] QC code, we have 6.05 < ^WGNC,min < gm and for the ^ 22; io] and the 

[160, 42, 10] codes, we obtained the bounds: 6.20 < Wp in < 9.09. 11 The slight increase m 

the lower bound from 6, which is the minimum weight of the [20, 7, 6] code, to 6.05 and 6.20 shows 
that the minimum AWGNC pseudo- weight of the [20, 7, 6] code is less than that of the [40, 12, 10], 
[80, 22, 10], and [160, 42, 10] codes. The slight increase in the lower bounds from 6.05 to 6.20 only 
suggests, but unfortunately gives no further evidence of, the existece of an increasing sequence of 
minimum AWGNC pseudo-weights for these codes, according to the results of this section. 

For completeness, we mention that the techniques in [15, 16] allow us to efficiently compute 
the minimum max-fractional weight for the above-mentioned codes: we obtain w™™_ frac = 4.67 
for the length-20 code, w™™_ frac = 5.31 for the length-40 code, w™™_ frac = 5.33 for the length-80 
code, and w™™__ frac = 5.33 for the length-160 code. Applying the results that were mentioned 
in Remark 13. 4| we see that these values yield weaker lower bounds on w ^ WGNC ' mm than the ones 
given in the previous paragraph. 

Using the same vertex enumeration program [21] we could compute as well the minimum 
pseudo- weights of the QC codes of length 4r given by the parity-check matrix Hqq(AT) in Exam- 
ple IH^I for r = 1,2,3,4. We present the results in Table HJ □ 



4 Analysis of Problematic Pseudo-Codewords in Convolutional 
Codes 

Studying pseudo-codewords of small pseudo- weight, and, in particular (since the minimum pseudo- 
weight is upper bounded by the minimum Hamming weight), studying pseudo-codewords of pseudo- 

10 The lower bounds are obtained by applying the techniques that were presented in [22]. 

11 Using some more sophisticated lower bounds from [22], one can actually show that 7.19 < w A WGNC . mm < 9.09 for 
the length-40 code. This implies that the lower bounds for the length-80 and length-160 codes can also be tightened. 
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weight smaller than the minimum Hamming weight, represents an important problem in the perfor- 
mance analysis of LDPC codes because it allows us to identify potential failures in MPI decoding. 

Upper and lower bounds on the minimum pseudo- weight of a convolutional code can be obtained 
by exploiting the "sliding" structure of its semi-infinite parity-check matrix H con v and some of its 
sub-matrices, which allows relatively easy computations by taking advantage of the increased 
sparseness compared to the corresponding parity-check matrix of an underlying QC code. On the 
one hand, this technique allows us to find certain existing low-weight pseudo-codewords, and on 
the other, it illustrates the advantage of using a convolutional code structure over a block code 
structure in pseudo-codeword analysis. In addition, similar to the possible increase in minimum 
distance expected when going from a QC code to its unwrapped convolutional version, we expect 
an increase in the minimum pseudo- weight of the convolutional code, leading to better performance 
compared to the original QC code. Our theoretical results and experimental observations point 
strongly in this direction. We now explain briefly our technique. 

Similar to associating with a convolutional code [23] an increasing sequence of column distances 
(di)i>o, and a decreasing sequence of row distances (dJ);>o, having the property that 

d c < d\ < ••• < d frcc < ••• < d\ < cf , 

we define two sequences of pseudo-weights that prove helpful in identifying the overall minimum 
pseudo- weight. 

We recall that an encoder polynomial generator matrix G conv (D) of a rate R = K/L convolu- 
tional code with encoder memory m e has associated with it a semi-infinite sliding generator matrix 
Gconv Let G^oni denote the jK x iL submatrix of G con v with rows indexed by the first j block 
rows of G conv and columns indexed by the first i block columns of G conv , and let 

d jti = min j w u (u • G^ni) u = (u , . . . , u,-_i) G ¥ J 2 K , u ^ j . 

The following two sequences of matrices 

Wl.l) p(2,2) Wlfl 
"conv i "conv i • • • > "conv) ■ ■ ■ > 



and 



^-(l,m c +l) -= T (2,m +2) -^{l,m e +l) 
"conv ' "conv ' • • • > "conv ' 



give us an increasing sequence of distances (g?[)/>o, d c t = dij, commonly called column distances, 
associated with the first sequence, and a decreasing sequence of distances (ti[);>o, d\ = dij +me , 
commonly called row distances, associated with the second sequence. The column distance is 
a "truncation" distance, i.e., it measures the minimum of the Hamming weights of the vectors 
of length / + 1 that constitute the first / + 1 components of some codeword with non-zero first 
component. The row distance d] is a "bounded codeword" distance, i.e., it measures the minimum 
of the Hamming weights of the codewords with non-zero first component and of length I + 1 or 
smaller, or equivalently, of polynomial degree I or smaller. The column distances and row distances 
represent valuable lower bounds and, respectively, upper bounds, on the free distance that become 
increasingly tight with increasing I, and, in the limit, become equal to the free distance. If similar 
sequences could be defined for pseudo-weights, they would prove helpful in identifying the overall 
minimum pseudo- weight. 

With this in mind, we define corresponding sequences of "truncated" pseudo-weights and 
"bounded pseudo-codeword" pseudo- weights. 
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Let H(D) be a polynomial parity-check matrix for a convolutional code C conv with syndrome 
former memory m s , and let H conv be its semi-infinite sliding parity-check matrix. Similar to the 

notation above, let H co ' nv be the jJ x iL submatrix of H conv with rows indexed by the first j 
block rows of H conv and columns indexed by the first i block rows of H conv . We will consider two 
sequences of such sub-matrices: 

Yj(ra s + l,l) Yj(m s +2,2) Tr( m s+M) 
n conv ' -"^conv ' ■ ■ ■ > -"-"-conv > ■ ■ ■ > 

in which H^™ 8 ^'^ is the (m s + l)J x IL submatrix of the (m s + k)J x kL matrix H^ v +fc '' : ' formed 
by its first IL columns, for all I < k, and 

Tt(1,1) Tt(2,2) 

n conv ' n conv ' • • • > n conv > • • • > 

in which H C Q nv is the IJ x IL submatrix of the kJ x kL matrix H co ^ v formed by its first IJ rows, 
for all I < k. 

In the first sequence, the first matrix that has with certainty a nonzero nullspace is jj(^^" me+1 ' me+1 ) ^ 
since there is a nonzero polynomial codeword of degree m e (associated with a scalar codeword of 
length (m e + Since jj(^^" me+1 ' me+1 ) [G^ n ^ c+1 ^] T = 0, these matrices act like parity-check 

matrices in computing the row distances, with H cc ^^ me+1+ '' me+1+ ^ giving the l-th row distance, 
for all I > 0. There might be nonzero nullspaces earlier in the sequence, so we denote by H^^'^ 
the first matrix with a non-zero nullspace. Obviously this would mean that H C onv is the first matrix 
in the second sequence Hconv, I > 0, to have a nonzero nullspace. Similarly, H^ v , Z > fi, will act 
like parity-check matrices in computing the Zth column distances. So by computing the nullspaces 
of these parity-check matrices we get upper and lower bounds on cZf ree for the convolutional code 
that are similar to the column and row distances defined from the generator matrix. 

We remark also that if we wrap the convolutional code modulo X r — 1, with r > m s + /j,, then 
the matrix ll^^v^'^ is a submatrix of the parity-check matrix Hqq of the QC code that remains 

unchanged after the wrapping. Hence a codeword of minimum weight for the matrix H con s v 
will be, if extended by zeros, a codeword in the QC code. If this codeword has weight equal to the 
minimum distance of the QC code, then the free distance of the convolutional code is equal to the 
weight of this codeword. The minimum distance of the QC code could be smaller, however, and 
in this case the free distance will only be upper bounded by the weight of this codeword and lower 
bounded by the minimum distance of the QC code. 

In what follows we will mimic the theory of row distances and column distances of a convo- 
lutional codes to bound the minimum pseudo-weight of the convolutional code. We obtain upper 
and lower bounds on the minimum pseudo-weight of the convolutional code as follows: 



„,,min / tt(1>1)\ ^ „,,min / tt( 2 ) 2 A ^ 
w p y^conv ) < w p y^conv ) < ' ' ' 

^ ...min m \ ^ ^ „,,min / Tf (m s +2,2)\ ...min /tt(™s+1,1)\ 

< w p (H conv j < • • • < w p [tl cony J < w p (^H conv J . 

As above, we will denote by H^^ 1 " 1 '^ and H^„ v \ respectively, the first matrices in the above 
sequences whose fundamental cones contain a non-zero vector. So by computing vectors in the 
fundamental cones of these parity-check matrices we get upper and lower bounds on tt;™ in for the 
convolutional code that are similar to the column and row distances defined from the generator 
matrix. 
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Example 4.1. The pseudo-codeword in Example 12.111 was obtained by attempting to compute 
small degree non-zero vectors in the fundamental cone of /C(H conv ) using the above technique. The 
first nonzero "row pseudo-weight" is 4, and the vector (3D 2 + D 3 , 4D + D 2 , 3 + D + 4D 2 + D 3 , 3 + 

4D + D ) is in the fundamental cone of H^£. Its AWGN pseudo -weight is 8.45, which is an upper 
bound on the minimum pseudo-weight of the convolutional code. The free distance of this code is 
10. The reduced pseudo-codeword (3X 2 + X 3 , 4X + X 2 , 3 + X + 4X 2 + X 3 , 3 + 4X + X 2 ) modulo 
mod (X r — 1), for r = 5, 10, 20, 40 has the same weight 8.45, larger than the minimum distance of 
the [20, 7, 6] code, which makes this pseudo-codeword irrelevant, but smaller than the minimum 
distances of the codes [40,12,10], [80,22,10], [160,42,10]. The upper bound u, AWGNC > min < g Qg 
in Example 13.111 therefore becomes y^WGNCmm < 3 45 based on this pseudo-codeword. □ 

This computational method has been applied successfully to larger codes as well. An example 
is the rate R = 2/5 LDPC-CC with syndrome former memory m s = 21 that was simulated 
in Fig. [21 The code was constructed by unwrapping a [155, 64] (3, 5)-regular QC-LDPC block 
code with minimum Hamming distance 20. The convolutional code has free distance 24, 12 which 
already suggests a possible performance improvement compared to the QC code. Following the 
approach described above, we constructed a class of pseudo-codewords for which we obtained a 
minimum pseudo-weight of 17.85. Thus this class of pseudo-codewords contains vectors of weight 
less than the free distance, which makes them relevant to the performance analysis of iterative 
decoding. Consequently, an upper bound on the minimum pseudo-weight of the convolutional 
code is 17.85, and, from the way we constructed this class of pseudo-codewords, we believe it is 
a very tight bound. Projecting this pseudo-codeword onto the QC codes obtained by wrapping 
the convolutional code gives upper bounds on the minimum pseudo-weight of these codes as well 
(in some cases tighter than the ones obtained using the methods of [2,3]). The upper bound 
in [2,3] for the minimum pseudo- weight of the [155,64] code is 16.4. These upper bounds together 
with our simulation curves suggest that the minimum pseudo-weight of the convolutional code is 
strictly greater than the minimum pseudo-weight of the [155,64] QC code. An evaluation of the 
exact values of the minimum pseudo-weight in this cases is not possible, however, due to the large 
complexity of such a task. Also note that if an upper bound on the minimum pseudo-weight of 
the convolutional code smaller than 16.4 could be found, it would decrease the upper bound on 
the minimum pseudo-weight of the [155, 64] (3, 5)-regular QC-LDPC block code as well. 



5 Conclusions 

For an LDPC convolutional code derived by unwrapping an LDPC-QC block code, we have shown 
that the free pseudo-weight of the convolutional code is at least as large as the minimum pseudo- 
weight of the underlying QC code. This result suggests that the pseudo-weight spectrum of the 
convolutional code is "thinner" than that of the block code. This difference in the weight spec- 
tra leads to improved BER performance at low-to-moderate SNRs for the convolutional code, a 
conclusion supported by the simulation results presented in Figs. |2 and El We also presented 
three methods of analysis for problematic pseudo-codewords, i.e., pseudo-codewords with small 
pseudo- weight. The first method introduces two sequences of "truncated" pseudo-weights and, 
respectively, "bounded pseudo-codeword" pseudo- weights, which lower and upper bound the min- 
imum pseudo-weight of the convolutional code, similar to the role that column distances and row 
distances play in bounding from below and above the free distance. The other two methods can 

12 The free distance of this convolutional code was obtained by R. Johannesson et. al. at Lund University, Dept. 
of Information Technology, using a program called BEAST. 
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be applied to any QC or convolutional code and consist of projecting codewords with small weight 
in QC codes or convolutional codes onto Tanner graph covers of the code. 
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